Noise Reduction by Maximum a Posteriori Spectral Amplitude Estimation with Supergaussian Speech Modeling
نویسندگان
چکیده
ESTIMATION WITH SUPERGAUSSIAN SPEECH MODELING Thomas Lotter and Peter Vary Institute of Communication Systems and Data Processing ( ) Aachen University (RWTH), Templergraben 55, D-52056 Aachen, Germany E-mail: lotter vary @ind.rwth-aachen.de ABSTRACT This contribution presents a spectral amplitude estimator for acoustical background noise suppression based on maximum a posteriori estimation and supergaussian statistical modeling of the speech DFT coefficients. The probability density function of the speech spectral amplitude is modeled with a simple parametric function, which allows a high approximation accuracy for Laplace or Gamma distributed real and imaginary parts of the speech DFT coefficients. Based on the approximation, a computationally efficient maximum a posteriori speech estimator is derived, which outperforms the Ephraim-Malah algorithm in a single channel noise reduction framework.
منابع مشابه
Speech Enhancement by MAP Spectral Amplitude Estimation Using a Super-Gaussian Speech Model
This contribution presents two spectral amplitude estimators for acoustical background noise suppression based on maximum a posteriori estimation and super-Gaussian statistical modelling of the speech DFT amplitudes. The probability density function of the speech spectral amplitude is modelled with a simple parametric function, which allows a high approximation accuracy for Laplaceor Gamma-dist...
متن کاملNoise Power Spectral Density Estimation based on Maximum a Posteriori and Generalized Gamma Distribution
Noise power spectral density (PSD) estimation is a crucial part of speech enhancement system due to its contributory effect on the quality of the noise reduced speech. A novel estimation method for color noise PSD on the basis of an assumption of generalized Gamma distribution and maximum a posteriori (MAP) criterion is proposed. In the experiment, generalized Gamma PDF which is a natural exten...
متن کاملSupergaussian Garch Models
In this paper, we introduce supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models for speech signals in the short-time Fourier transform (STFT) domain. We address the problem of speech enhancement, and show that estimating the variances of the STFT expansion coefficients based on GARCH models yields higher speech quality than by using the decision-directed metho...
متن کاملProceedings of Meetings on Acoustics
Estimation of the power spectral density (PSD) of noise is crucial for retrieving speech in a noisy environment. 3 novel methods for estimating the non-white noise PSD of noisy speech based on a generalized gamma distribution and 3 criterions are proposed, which are minimum mean square error (MMSE), maximum a posteriori (MAP) and Maximum likelihood estimation (MLE). Because of the highly non-st...
متن کاملSpeech spectral modeling and enhancement based on autoregressive conditional heteroscedasticity models
In this paper, we develop and evaluate speech enhancement algorithms, which are based on supergaussian generalized autoregressive conditional heteroscedasticity (GARCH) models in the short-time Fourier transform (STFT) domain. We consider three different statistical models, two fidelity criteria, and two approaches for the estimation of the variances of the STFT coefficients. The statistical mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003